# Librispeech fine-tuning
Wav2vec2 Conformer Rope Large 100h Ft
Apache-2.0
Wav2Vec2 Conformer model fine-tuned on 100 hours of Librispeech data, incorporating rotary position embedding technology
Speech Recognition
Transformers English

W
facebook
99
0
Data2vec Audio Large 100h
Apache-2.0
Data2Vec is a general self-supervised learning framework applicable to speech, natural language processing, and computer vision tasks. This model is a large-scale model pre-trained and fine-tuned on 100 hours of Librispeech audio data.
Speech Recognition
Transformers English

D
facebook
46
2
Data2vec Audio Large 10m
Apache-2.0
Data2Vec is a general self-supervised learning framework applicable to speech, vision, and language tasks. This large audio model is pre-trained and fine-tuned on 10 minutes of Librispeech data, suitable for 16kHz sampled speech audio.
Speech Recognition
Transformers English

D
facebook
19
0
Data2vec Audio Base 100h
Apache-2.0
Data2Vec is a general self-supervised learning framework applicable to speech, vision, and language tasks. This audio base model was pre-trained and fine-tuned on 100 hours of Librispeech audio data.
Speech Recognition
Transformers English

D
facebook
4,369
1
Featured Recommended AI Models